Evolution of Genetic Redundancy : 
The Relevance of Complexity in Genotype-Phenotype Mapping 
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Genetic redundancy is ubiquitous and can be found in any organism. However, it has been argued 
that genetic redundancy reduces total population fitness, and therefore, redundancy is unlikely to 
evolve. In this letter, we study an evolutionary model with high-dimensional genotype-phenotype 
mapping (GPM) to investigate the relevance of complexity in GPM to the evolution of genetic 
redundancy. By applying the replica method to deal with quenched randomness, the redundancy 
dependence of the fitness is analytically obtained, which demonstrates that genetic redundancy can 
indeed evolve, provided that the GPM is complex. Our result provides a novel insight into how 
genetic redundancy evolves. 
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All living organisms are under selection pressures, 
which act on their phenotypes, while only their genotypes 
are heritable. Therefore , the connection between geno- 
type and phenotype, referred to as genotype-phenotype 
mapping (GPM), is indispensable to fully understand 
evolutionary processes. However, GPM is generally com- 
plex and stochastic. Many phenotypic traits are now 
known to be a result of complex processes involving in- 
teractions between many proteins, RNAs, and genes. For 
example, developmental processes are largely regulated 
by transcriptional networks that are modeled by high- 
dimensional and non-linear equations [H 0] ■ Since such 
inherent complexity in GPM obscures which genotype is 
associated with a high-fitness phenotype, this complexity 
can have an impact on evolution. However, the relevance 
of GPM complexity to evolutionary processes has not 
been fully explored as yet. 

An important characteristic of GPM is genetic redun- 
dancy, i.e ., coding of a phenotypic trait by two or more 
genes. Numerous examples of genetic redundancy have 
been found in higher organisms 0,0] and even in microor- 
ganisms Q . A classical premise of evolutionary theory 
is that genetic redundancy lowers fitness at the popu- 
lation level, thus the redundancy would be evolutionar- 
ily suppressed For example, studies have shown 

that genes that have been duplicated lose its function in 
one of the redundant genes |10l4l3| . According to this 



argument, genetic redundancy reduces the sensitivity of 
fitness to mutation, and thus deleterious mutations may 
not be eliminated, leading to a decrease in the total fit- 
ness of the population. In contrast, individuals without 
genetic redundancy are generally more susceptible to the 
deleterious effects of mutations, so much so that most 
mutants are lethal and mutants with lower fitness are 
effectively removed, thus maintaining the high fitness of 
the population. This suggests that genetic redundancy is 
evolutionarily unstable. In particular, the suppression of 
redundancy is pronounced in asexual populations. Sev- 



eral studies have explored the conditions that enable the 
evolution of genetic redundancy, but it is still not well 
understood [a, 0, [l^ - flij . Specifically, the relevance of 
complexity in GPM to the evolution of genetic redun- 
dancy has not been evaluated. 

In the present Letter, we take an asexual evolution- 
ary model and study genetic redundancy to evolution, by 
comparing the results from simple and complex GPMs. 
Unlike the classical view mentioned above, we find that 
populations with higher genetic redundancy can have 
higher fitness under complex GPMs . This preference 
for genetic redundancy is characteristic of the high- 
dimensional complex GPM and is independent of previ- 
ously reported mechanisms [gI. IsI. H^ - flq . Therefore, this 
study will provide a novel explanation for the ubiquity of 
genetic redundancy in biological systems. 

In order to model both simple and complex GPM, we 
introduce an adiabatic spin system, where complex GPM 
is represented as a non-trivial mapping with quenched 
randomness. Indeed, Sakata et al. [131 adopted a spin sys- 
tem with quenched randomness to study evolution (not 
of genetic redundancy) under complex GPM. 

In this model, configuration of A'' sites of a locus is 
described by g = {gi, g2, ■ ■ ■ , gN), where each gi can take 
two different allelic states gi = ±1. Genotype g de- 
termines M phenotypic traits, which is represented by 
p = (pi,p2, ■ • • ,Pm), where each pi can take pi = ±1 
corresponding to whether the ith trait is expressed or 
not. Genetic redundancy is characterized by the param- 
eter 7 = N/M. For a given genotype g, phenotype p is 
determined in a stochastic manner whose probability de- 
pends on stochastic GPM P{p\g). We model this GPM 
by the following conditional probability. 



Pip\g) = exp(/3^ Jyp,5j)/Zp(g), 



(1) 



where determines the mapping from gj to pi and 
generally can be a quenched random variable. Here 
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= Tp is the temperature for the stochastieity in 
GPM, which represents the strength of phenotypic fluc- 
tuation in isogenic individuals, and Zp{g) is the partition 
function of p. The phenotype fluctuates during an indi- 
vidual lifetime due to developmental noise or environ- 
mental variation. These phenotypic fluctuations are suf- 
ficiently faster than the changes in genotype mediated by 
the evolutionary process of selection, reproduction, and 
mutation. Thus, the phenotypic fluctuation can be adia- 
batically eliminated. Hence, once g is obtained, one can 
assume that the distribution of p is uniquely determined 
by Eq. ([T|) . Changes in genotype distribution on an evolu- 
tionary time-scale are dominated by mutation and selec- 
tion processes under a given fitness function of the phe- 
notype p. Instead of introducing a complex fitness land- 
scape, we adopt a simple fitness function ^{p) = J2iPi^ 
to focus on the relevance of GPM to evolution. By the 
adiabatic assumption, only effective fitness (f>{g), the av- 
erage of fitness over p under given g, contributes 
to slower evolutionary dynamics of g. Distribution P{g) 
of genotype g in the population at equilibrium is deter- 
mined only by the effective fitness 4>{g) and "genotypic 
temperature" Tg, which represents the ratio of mutation 
rate to selection pressure. The distribution of genotypes 
is approximated by the Boltzmann distribution as 



P(g) = exp(/3>(g))/Zg, 



(2) 



where /?' = and Zg is a partition function of g. 

We study two extreme cases of GPM in order to 
compare the evolutionary steady states under complex 
and simple GPMs. As an example of simple GPM, 
Jij is chosen to be Jij ~ Jq/N without randomness, 
whereas an example of complex GPM is represented by 
quenched random variables Jij drawn from P(Jij) = 
exp [-jy2{ayN)]/^/2^^. 

First, we consider the behavior of simple GPM ^ 
Jq/N. In this case, the explicit form of effective fit- 
ness (j){g) can be calculated as (j){g) = Ti-pP{p\g)^{p) = 
Mtanh{l3JoJ2'j' gj/N). For sufficiently large N, Zg is 
thus obtained as 



Zg ~ exp 



N 
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where m is obtained from the following saddle point equa- 
tion. 
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The mean fitness per unit phenotype 7('/>) is given by 
= n"Sf^^^9 ~ tanh/?JoTO. Here, we adopt the fit- 
ness per unit phenotype 7 (</)), rather than the fitness per 
locus (0) in order to evaluate the contribution of genetic 



redundancy to fitness, because "f{4>) provides a natural 
measure to compare systems with high redundancy (large 
N) to ones with low redundancy (small A''), each with a 
fixed number of phenotypes M. In Fig. [T]-(a), Tg and Tp 
dependency of 7(0) for 7 = 1 is represented. This figure 
illustrates that j{(f)) decreases by the increase of either Tg 
or Tp, i.e., with the increase in mutation rate or pheno- 
typic fluctuation. Fig. [T]-(b) and (c) show 7 dependency 
of 'y{(t>) with fixed Tp and Tg, showing that ^{(j)) decreases 
as 7 increases. Since the 7 dependency of 7 (<?!>) appears 
in Eq. [S] with the form /3'/7, one may find that increas- 
ing 7 corresponds to the increase in Tg and thus leads to 
the decrease in fitness. This implies that more mutations 
tend to accumulate in the presence of redundant genes, 
which leads to a reduction in fitness. This is consistent 
with the suggestions that genetic redundancy weakens 
the effectiveness of selection in eliminating deleterious 
mutations in a population, thus causing the decline in 
population mean fitness [3t31- As a consequence of this 
accumulation of mutations, genetic redundancy enhances 
genetic diversity. This is represented by the increase in 
entropy per unit phenotype 7s = 7(logZc, — f5'{(f>)), as 
shown in Fig. [T]-(d). It follows from these results that, in 
the case of simple GPM, genetic redundancy 7 decreases 
fitness per unit phenotype 7(0) and increases entropy per 
unit phenotype 7s. 

Next, let us consider the case with a complex GPM, 
where Jij is a quenched random variable drawn from 
P(J,,) = exp [-jy2{a^jN)] /^2^{al/N). The ex- 
plicit form of effective fitness (t){g) can be calculated 

as (j){g) — Y^f ta.-nh{l3Y^^ Jijgj)- In this case, we 
need to calculate the expectation value of free energy 
over quenched random variables, rather than that of 
Zg. To calculate a quenched average of the free energy 



[F{Tg)]j = J F(Tg)dJP{J), the replica method [iSl is 
useful, where we first calculate [Zg]j for an integer n, 
and then [log Zg]j is obtained from 



(6) 



After some algebra, [Zg]j for an integer n is obtained as 

[Z]" ~ I + A^ (7-ilog*(g'^'^') + ^(9'"'',^''''')) , (7) 

where and are the spin-glass order parameter 
and its conjugate. Here, '^{q^^^^ ) and S{q'^'^ ,u!^^ ) are 
given by 

5(g^^',c.^^') = ilogTr{g}e^.>.' ^"s^sf^^^' (g) 



and 
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where rrii = {m\ . . .m^), fo{m^) = tanh/^m^ and Q is 
n X n matrix with diagonal elements, 1, and off-diagonal 
elements, q'^'^ . For sufficiently large TV, order parameters 
qi^i^ and w'^'^ in Eq. ([5]) and Eq. © are determined by 
the following saddle point equation: 



(a) I 



7" 



(10) 
(11) 



Here 
exp(X; 



indicates the average over the distribution 

5r3r'a;'^'^')/Tr,i...g™exp(E,.>,.,ffr5rw^^'). 
For further calculation, we assume replica symmetry 
(RS) ansatz as q'^'^ = q and o;^^ ~ uj for all [i ^ p! . 
This ansatz provides equations for q and a; as 



: J I?z (tanh(-y/wz)) , 



(12) 
(13) 



where 

cates 

tion 



Vz = e"^''/^dz/V27r and [-Je indi- 
taking an average over the distribu- 
cxp(0(to, z; /?'))/ J (iTOexp(0(m, z; /?')) with 
e(m, z; /?') = /^7o(to) - (m + y^z)V2cr2(l - q). For 
all the regions on the Tg-Tp plane, q is always positive 
except for when = oo or Tp = oo, indicating that the 
model has no paramagnetic phase. By using q and uj in 
Eq (fT2|) . free energy per locus for the RS solution, /rs, 
is calculated as 
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+ /^log/dm 



/ 2?z log 2 cosh (-y/w-z) 

e(n,,j;^') 



V27ra2(l-q))- 



The RS solution becomes invalid when the de Almeida- 
Thouless (AT) condition [l^ is broken, accompanied 
by replica symmetry breaking (RSB). Therefore we per- 
form Monte Carlo simulations (MCS) to estimate the fit- 
ness and entropy for the RSB phase, while theoretical es- 
timates based on RS ansatz are used in the RS phase. In 
Fig-HKa), the yellow dashed line shows the phase bound- 
ary between the RS and RSB phases on the Tg-Tp plane 
obtained by AT conditions and indicates that RSB occurs 
when either Tg or Tp are small. Details of the derivation 
of AT conditions and phase boundary are given in the 
Supplemental Materials. 

By differentiating — 7/3'//js with respect to /?', the 
mean fitness per unit phenotype of the RS solution, 7(1/)), 
is obtained as 



7(0) = / 2? 



/ dmtanh^m e®("'^''^') 
/ dm eQ(™'^^/'') 



(14) 



In the RS phase, the estimates obtained using an MCS 
agree with the above theoretical estimates. A color map 
in Fig. [l]-(a) shows Tp and Tg dependency of the fitness 
per unit phenotype 7(0) with fixed 7 and illustrates that 
decreases with either Tp or Tg increases. This de- 
crease of 7(0) against Tp and Tg is consistent with the 
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FIG. 1. (COLOR ONLINE) Fitness and entropy for simple 
GPM. (a) A color map of the mean fitness per phenotype 
7(0) for 7 = 1 is described on the Tp — Tg plane, (b) 7 
dependency of the mean fitness per phenotype 7(0) with fixed 
Tp = 1 and Tg = 0.1, 0.5, 1.0, and 1.5. (c) 7 dependency of the 
mean fitness per phenotype "/{4>) with fixed Tg = 1 and Tp = 
0.1,0.5, 1.0, and 1.5. (d) 7 dependency of the mean entropy 
per phenotype 7s with fixed Tp — 1 and Tg = 0.1,0.5, 1.0, 
and 1.5. In (a)-(d), the parameter ajg = 1 is used. 



results of the analysis of simple GPM. Redundancy de- 
pendence of 7((/)) with fixed Tp and Tg is shown in Fig.[2]- 
(b) and (c). In contrast to the case with simple GPM, 
"f{4>) increases with increasing 7. This result is opposite 
to the classical point of view, which implies that genetic 
redundancy decreases fitness at evolutionary equilibrium. 

Entropy per phenotype, 7s, is interpreted as the di- 
versity of genotypes in a population, and is calculated 
as 7s = — 7/3'((0) -I- fjis) for the RS phase. 7s is eval- 
uated for the RS and RSB phases num erically by using 
a multicanonical Monte Carlo method 




-22|, which is 

often used in spin glass systems and other fields [2^ [IJ] ■ 
Figure [21- (d) shows 7 dependency of 7s with fixed Tp and 
Tg. As shown in Fig. [2]-(d), 7s almost increases linearly 
with 7. This increase of 7s against 7 is similar to the 
case of simple GPM, where the presence of redundant 
genes allows for accumulation of mutations, and thus, an 
increase in the diversity of genotypes found in a popula- 
tion. To conclude, in the case of complex GPM, genetic 
redundancy 7 increases both fitness per unit phenotype, 
7(0), and entropy per unit phenotype, 7s. 

In the present study, an adiabatic spin model with 
high-dimensional GPM is investigated, where both the 
complexity in GPM (represented by random quenched 
variables Jy ) and genetic redundancy (7) are controllable 
parameters. We compared evolutionary steady states be- 
tween for simple and complex GPM cases. In the case of 
simple GPM, fitness per unit phenotype 7((/)) decreases 
as 7 increases (Fig. [T]-(b) and (c)), indicating that redun- 
dancy should be suppressed by selection pressure, which 
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FIG. 2. (COLOR ONLINE) Fitness and entropy for complex 
GPM. (a) A color map of mean fitness per phenotype 7(1^) 
for 7 = 1 is described on the Tp-Tg plane. The yellow dashed 
line indicates the phase boundary between RS and RSB de- 
termined by the condition AT2(a), which is described in the 
Supplemental Materials. Below the line, the system is in the 
RSB phase, (b) 7 dependency of mean fitness per phenotype 
7((/>) with fixed Tp = 1 and Tg = 0.1,0.5,1.0 and 1.5. (c) 
7 dependency of mean fitness per phenotype 7(9^) with fixed 
Tg = 1 and Tp = 0.1,0.5,1.0 and 1.5. (d) 7 dependency 
of mean entropy per phenotype 7s with fixed Tp — 1 and 
Tg — 0.1,0.5,1.0 and 1.5. In (a)-(d), the parameter aj = 1 
is used. In (b)-(d), thick lines indicate theoretical estimates 
from the RS ansatz and large circles indicate the transition 
point given by the AT2(a) condition. Thin dashed lines rep- 
resent invalid estimates from the RS ansatz. In (b) and (c), 
dots indicate estimates by MCS over 100 random realizations 
of J for A'^ = 50 using the Metropolis algorithm [2^. In (d), 
dots indicate estimates by MCS over 10 random realizations 
of J for N = 50 using the multicanonical algorithm [20l - [2^ . 



is consistent with the classical view ■ As is shown in 
Fig. [I]-(d), the decrease in redundancy reduces entropy 
per unit phenotype 7s. Since less genetic diversity in the 
population hinders accessibility to a novel genotype and 
suppresses evolvability, selection pressure toward less ge- 
netic redundancy for simple GPM will lead a population 
to an evolutionary dead-end. 

Remarkably, this is not true for a complex GPM. 
Under complex GPM, a population with higher 7 ex- 
hibits higher fitness as shown in Fig.[2]-(b) and (c). 
Therefore, genetic redundancy, i.e., a system with higher 
7 = N/AI can evolve, which is contrary to the classical 
view of genetic redundancy. A possible explanation is 
as follows: Larger N provide greater degrees of freedom 
in g to optimize the fitness, and thus enables realization 
of g that provides higher fitness than the highest fitness 
found in systems with smaller N. At the same time, a 
system with smaller M decreases the variety of connec- 
tions from gi to pi, and diminishes frustrations in gi to 
optimize the fitness. Therefore a system with larger N 



and smaller M can provide higher fitness. 

As is shown in Fig. [2]-(d), larger 7 accompanies in- 
creases in entropy per unit phenotype 7s, and thus, in 
growth in genetic diversity. This allows a population to 
access a variety of novel genotypes, which could accel- 
erate the emergence of evolutionary innovations. Thus, 
a population with complex GPM can have evolvability. 
It is interesting to point out that it has been suggested 
that genetic diversity enables a population to rapidly re- 
spond to large environmental changes 16, 2^ 27 1, thus 
our results here also suggest that a population with com- 
plex GPM could have potential ability to adapt to new 
environments. 

The model with complex GPM exhibits RS/RSB tran- 
sition. Interestingly, such a transition was also reported 
in a different model by Sakata et al [l7[, which claims 
that the RSB phase is biologically unfavorable. Simi- 
lar to their study, we also demonstrate that phenotypic 
fluctuations suppress the appearance of the RSB phase. 
This RS/RSB transition can appear in many evolution- 
ary models with complex GPM, and a better understand- 
ing of the biological significance of this transition is wor- 
thy of further investigation. 

In summary, we have demonstrated how complexity 
in GPM promotes evolution of genetic redundancy. The 
mechanism is general, and the recombination process in 
a sexual population does not weaken the proposed pref- 
erences in genetic redundancy. Selection processes with 
complex GPM make gene duplication preferable, which 
can further enhance the potential of emerging novelty in 
evolution. 
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